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Abstract 

Encodings, that is, injective functions from words to words, have been 
studied extensively in several settings. In computability theory the notion 
of encoding is crucial for defining computability on arbitrary domains, as 
well as for comparing the power of models of computation. In language 
theory much attention has been devoted to regularity preserving functions. 

A natural question arising in these contexts is: Is there a bijective 
encoding such that its image function preserves regularity of languages, 
but its preimage function does not? Our main result answers this question 
in the affirmative: For every countable class L of languages there exists a 
bijective encoding / such that for every language L £ £ its image f[L] is 
regular. 

Our construction of such encodings has several noteworthy conse¬ 
quences. Firstly, anomalies arise when models of computation are com¬ 
pared with respect to a known concept of implementation that is based on 
encodings which are not required to be computable: Every countable de¬ 
cision model can be implemented, in this sense, by finite-state automata, 
even via bijective encodings. Hence deterministic finite-state automata 
would be equally powerful as Turing-machine deciders. 

A second consequence concerns the recognizability of sets of natural 
numbers via number representations and finite automata. A set of num¬ 
bers is said to be recognizable with respect to a representation if an au¬ 
tomaton accepts the language of representations. Our result entails that 
there exists a number representation with respect to which every recursive 
set is recognizable. 


1 Introduction 

In order to define computability of number-theoretic functions through compu¬ 
tational models that operate on strings of symbols from an alphabet E (rather 
than defining computability directly via recursion schemes) one usually employs 
(number) representations, that is, injective functions r : N —> E*. A function 
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/ : N —> N is called r-computable {computable by a Turing machine using repre¬ 
sentation r) if there exists a Turing-computable function : E* —>■ S* such that 
(por = ro f. For representations r that are informally computable (i.e., there is 
a machine-implementable algorithm that always terminates, and computes r), 
it can be argued on the basis of Church’s thesis (similar as e.g. in [20, p. 28]) 
that r-computability does not depend on the specific choice of r, and coincides 
with partial recursiveness. 

Shapiro [25] studied the influence that (unrestricted) bijective representa¬ 
tions r have on the notion of r-computability. He found that the only functions 
that are r-computable with respect to all bijective representations r are the al¬ 
most constant and almost identity functions; and that there are functions that 
are not r-computable for any representation r. Furthermore, he defines ‘accept¬ 
able’ number representations: a bijective representation r is called ‘acceptable’ 
if the successor function lifted to the r-coded natural numbers is Turing com¬ 
putable. He goes on to show that, a representation r is acceptable, if and only 
if r-computability coincides with partial recursiveness. 

In this paper we focus on the notion of computability by finite automata of 
sets of natural numbers. In particular, we investigate how number representa¬ 
tions determine the sets of natural numbers that are computable by finite-state 
automata. Such sets are called ‘recognizable’: a set S' C N is called r-recogniz- 
able {recognizable with respect to representation r), if there is a finite automaton 
that for all n S N decides membership of n in S when r{n) is given to it as 
input. 

We are interested in comparing representations r with respect to their com¬ 
putational power as embodied by the r-recognizable sets. This idea gives rise to 
a hierarchy via a subsumption preorder between representations: ri : N —>■ H* 
subsumes r 2 : N ^ H* if all r 2 -recognizable sets are also ri-recognizable. There 
are several natural questions concerning this preorder; to name a few: 

(i) When does a number representation subsume another? 

(ii) Is the hierarchy proper: do there exist representations ri and r 2 such that 
ri subsumes r 2 , but not vice versa? 

(iii) Is there a representation that subsumes all others? 

(iv) Is every (injective) number representation subsumed by a bijective number 
representation? 

(v) What classes C C p(N) of sets of natural numbers are recognizable with 
respect to a number representation? 

As our computational devices are finite automata, all of these questions boil 
down to problems in language theory. In particular the comparison of number 
representations is intimately connected with encodings, injective mappings from 
words to words, that have the property that their image function preserves 
regularity of languages. For bijective number representations / : N —A* and 
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g : N ^ B*, we have that / subsumes g if and only if the set function 

ifo9-^)[-] 

preserves regularity of languages; here we use the notation h[_] to denote the im¬ 
age function of a function h. Regularity preserving functions play an important 
role in different areas of computer science, and have been studied extensively. 
An important result in this area is the work [16, 14] of Pin and Silva, providing 
a characterization of regularity preservation of preimage functions in terms of 
uniformly continuous maps on the profinite topology. 

A natural question that presents itself then is the following: 

Are there bijective functions / : E* —> S* whose image function 
/[_] preserves regularity of languages, but whose preimage function 
/“^[_] does not? 

For bijective word functions we experienced this to be a very challenging ques¬ 
tion, which to the best of our knowledge, has remained unanswered in the liter¬ 
ature. Using the results of [16, 14], it can equivalently be formulated as follows: 


Are there bijective functions / : E* —>■ E* such that f is uniformly 
continuous, but f~^ is not uniformly continuous in the profinite 
topology? 

Concerning recognizable sets and the hierarchy of number representations, the 
question translates to: 

Are there bijective number representations f and g such that f strictly 
subsumes g? 

If this were not the case, subsumption would imply equivalence for bijective 
number representations, and the hierarchy would collapse. 

Our main result (Theorem 8), which allows us to answer all of the above 
questions, is the following: 

Main Theorem. For every countable class C C p(E*) of languages over a 
finite alphabet E, and for every alphabet F with |F| > 2, there exists a bijective 
encoding / : E* ^ F* such that for every language L & C its image f[L] is 
regular. 

With respect to computability theory and recognizable sets of natural num¬ 
bers, this result can be restated as follows: 

For every countable decision model Ai C p(N), there exists a bi¬ 
jective representation / : N —>■ E* such that every set M € A4 is 
f-recognizable. 
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As a direct consequence, when allowing for arbitrary bijective number represen¬ 
tations, we find the unsought: 

Finite automata are as strong as Turing-machine deciders. (/) 

That is, there is a bijective representation such that finite automata can recog¬ 
nize any computable set of natural numbers. 

Our result also has consequences in the context of the work by Boker and 
Dershowitz on comparing the power of computational models, as described be¬ 
low. Models over different domains are typically compared with the help of 
encodings that translate between different number representations. In order to 
prevent encodings from changing the nature of the problem, they are usually 
required to be ‘informally algorithmic’, ‘informally computable’, or ‘effective’ 
(see e.g. [20, p.27]). However, the latter concepts are rather vague, and in any 
case non-mathematical. Therefore they are unsatisfactory from the viewpoint 
of a rigorous conceptual analysis. 

In the formal approach for comparing models of computation proposed by 
Boker and Dershowitz in [3, 5, 4], encodings are merely required to be injective. 
On the basis of this stipulation, a computational model AI 2 is defined to be ‘at 
least as powerful as’ A4i, denoted by A4i < M. 2 , if there exists an encoding 
p : E* —>■ S 2 with the property that for every function / computed by there 
is a function g computed by A42 such that the following diagram commutes: 
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(In order to highlight the encoding used. Mi <p M 2 is written.) Although 
encodings are not required to be (informally) computable, this approach works 
quite well in practice. 

However, in analogy to what we found for recognizability, one runs into the 
anomaly (/) again, namely when comparing the power of decision models with 
the preorder <. Our main result implies C < DFA for every countable class of 
decision problems C, where DFA denotes the class of deterministic finite-state 
automata. Even stronger, it follows that there is a bijective encoding p such 
that C DFA. As a consequence we obtain that TMD <p DFA holds for the 
class TMD of Turing-machine deciders, and a bijective encoding p. 


Further Related Work 

For a general introduction to automata and regular languages we refer to [21, 8]. 
We briefly mention related work with respect to regularity preserving functions 
apart from work [16, 14] of Pin and Silva that we have already discussed above. 
The works [26, 9, 13, 22, 10, 23, 12] investigate regularity preserving relations; 
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in particular, [23] provides a characterization of prefix-removals that preserve 
regularity. In [15], Pin and Sakarovitch study operations and transductions that 
preserve regularity. In [1 1] , Kozen gives a characterization of word functions over 
a one-letter alphabet whose preimage function preserves regularity of languages. 
The paper [2] by Berstel, Boasson, Carton, Petazzoni and Pin characterizes 
language preserving ‘filters’; a filter is a set F C N used to delete letters from 
words of the language as indexed by elements of the filter. 

2 Preliminaries 

We use standard terminology and notation, see, e.g., [1]. Let E be an alphabet, 
i.e., a finite non-empty set of symbols. We denote by S* the set of all finite 
words over E, and by e the empty word. The set of infinite sequences over E is 
E^ = {tj j cr : N ^ E} with N = {0,1,2,...}, the set of natural numbers. 

A deterministic finite-state automaton (DFA) is a tuple A = (Q,S,S,qo,F} 
consisting of a finite set of states Q, an input alphabet E, a transition function S : 
Q X E —>• Q, an initial state go € Q, and a set F C Q of accepting states. The 
transition function S is extended to <5* : Q x E* —>■ Q by 

5*{q,e) = q 6*(q,aw) = 6*{6{q,a),w), 

for all states q G Q, letters a G E and words w G E*. We will write just 6 for 
6*. A word ic G E* is accepted by A if S{qo,w), the state reached after reading 
w, is an accepting state. We write Lang(A) for the language accepted by the 
automaton A, i.e., Lang(A) = {w G E* j S{qo,w) G F}. 

A DFA with output (DFAO) is a tuple {Q, E, <5, qo, A, A), with the first four 
components as in the definition of a DFA, but with, instead of a set of accepting 
states, an output alphabet A, and an output function X : Q ^ A. A DFAO 
A = {Q, E, 6, qo, A, A) realizes a function mapping words over E to letters in A; 
we denote this function also by A, that is, we define A : E* —> A by 

A{w) = X{6{qo,w)) for all lu G E*. 

A DFA {Q, E, d, qo, F) can thus be viewed as a DFAO {Q, E, 6, qo, {0,1}, xf) 
where xf is the characteristic function of F; instead of a state being accepting 
or not, it has output 1 or 0 respectively. 

Two automata A and A! over the same input alphabet E are equivalent on 
a set A C E* if A{x) = A'{x) for all x G X. We use this functional notation 
also for a DFA A, stipulating: A{x) = 1 {A{x) = 0) iff A accepts x (A does not 
accept x). 

A partition F of a set 1/ is a family of sets P C p{U) such that 0 ^ P, 
A= U, and for all A,B G P with A ^ B, Af] B = 0. 

The pigeon hole principle (PHP) states that if n pigeons are put into m 
pigeonholes with n > m, then at least one pigeonhole contains more than one 
pigeon. PHP for infinite sets is that if infinitely many pigeons are put into 
finitely many holes, then one hole must contain infinitely many pigeons. 
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Let A, B, X, and Y be sets, with X C A and Y C B. For a function A ^ B 
we write f[X] for the image of X under /, that is, f[X] = {f{x) \ x S X}. 
Likewise, we write f~^[Y] for the preimage of Y under /, i.e., f~^[Y] = {x \ 


f{x) e y}. 


A function F : p(E*) —>■ p(r*) preserves regularity if F(L) is regular when¬ 
ever L is a regular language. 


3 Main Results 


In this section, we prove our main results. We first work towards Theorem 8 
stating that for every countable class £ of languages there exists a bijective 
encoding / : E* —>• F* such that f[L] is regular for every language L £ C. The 
proof proceeds in two stages: we first prove the existence of injective encodings 
(Lemma 2), and then strengthen this result to bijective functions (Lemma 6). 
From Theorem 8 we then obtain the existence of bijective functions that are 
regularity preserving but not regularity reflecting. Corollary 9. 

For injective encodings / we cannot require the images f[L] to be regular; 
instead we require f[L] to be recognizable among /[E*]. This leads to the notion 
of ‘relatively regular in’. 

Definition 1. Let L,M C E* be formal languages over the alphabet E, with 
L C M. Then L is relatively regular in M if there exists a regular language R 
such that L — RCiM. 

So a regular language S' C E* is relatively regular in E*. 

Lemma 2. Let E and F be finite alphabets, with |r| > 2. Let C C p(E*) he a 
countable set of formal languages. There exists an injective function / : E* ^ F* 
such that for every L £ C, f[L] is relatively regular in /[E*]. 

Proof. Let Li, L 2 , L^,... be an enumeration of £, and let vo,vi,V 2 ,. ■. be an 
enumeration of E*. For i > 1, we write Xi '■ {O;!} for characteristic 

function of Li, that is. 



for all V £ Y*. 


Without loss of generality we assume F = {0,1,..., fc — 1} for some k > 2. 
Define the function / : E* —>■ F* by 


fM = Xl(Wn) X2iVn) ' ' ' Xn{Vn) , 


for all n € N. 


For every i = 1, 2,..., we construct a DFAO Ai and show that f[Li] = Lang(Ai)n 
/[E*] witnessing that f[Li] is relatively regular in /[E*], as required. 

Fix an arbitrary integer i > 1. Define Ai = {Q,T,S,0,X) where Q = 
{0,1,... ,i — 1} U {io,ii}, F = {0,1,..., /c — 1}, 6 : Q x T —>• Q is defined 
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by 


5[q,a) =q + l 

for all q G {0,1,..., J — 2} and a G F, 

6{i - 1,0) = io 


S{i — 1, a) = ii 

for all a G F \ {0}, 

Siij,a) = ij 

for all j G {0,1}, 


and A : Q —>■ r is defined, for all g G {0,1,..., f — 1}, by 

A(g)=XiK) A(fo)=0 A(ii) = l. 

The automaton Ai is depicted in Figure 1. 


0/Xi(i'o) 





u 


Figure 1: The DFAO Ai used in Lemma 2. A transition labelled F stands for 
transitions labelled a for every a G F. 

We show that f[Li] = Lang(Ai) fl /[S*]; equivalently, for all v G E*, 

V e Li f{v) G Lang(Ai). 

Let V G Y,*, so V = Vn for some n G N. Note that |/(u„)| = n. Hence, if n < i, 
the automaton Ai is in state n after having read the word f(vn) and outputs 
Xi(vn)- If ’T' > L then after having read f{vn), the automaton is in state ij, 
where j is the f-th letter of f{vn), that is, j = Xi(^^n)- In both cases we get 
Vn G Li if and only if /(n„) G Lang(Ai). □ 

For lifting the result of Lemma 2 from injective to bijective encodings, we 
need some preliminary notions and results. 

Definition 3. Let 17 be a set and /, C C H. The set C is attracted to I ii C C I 
whenever C fl / is finite. For a partition E of U, we say that E is attracted to 
/, when, for every C G E, C is attracted to I. 

Equivalently, E is attracted to I if for every CgE, A C\I=^0 then C Hi 
is infinite. 

Lemma 4. Let E = {Co, Ci ..., Cn-i} be a finite partition of E* with Ci C E* 
a regular set for every i G {0,1,..., n — 1}. Let ICY* and assume that E is 
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attracted to I. For every DFA A there exists a DFA A' such that A' is equivalent 
to A on I and the refined partition 

E'= {Cl^\i€ {0,1,...,n-l},j G {0,1}} 
is attracted to I, where, for i G {0,1,... ,n — 1}, j G {0,1} , 

CE =Qn{uGE* I A'(u) = J} . 

Proof. We start with A' = A, and repeatedly adapt A' (and therewith E') until 
E' is attracted to I, in such a way that equivalence with A is upheld. 

Assume that E' is not attracted to I. Then there exist i G {0,1,... ,n — 1} 
and j G {0,1} such that C-j \ I ^ 0 but C{ j fl / is finite. Without loss of 
generality, assume that j = 0. Since Ci = it follows that Ci\I ^ 0 

and hence Ci 0 I is infinite by assumption. By the pigeonhole principle (for 
infinite sets) it follows that C' ^ fl / is infinite. Since Ci is a regular set and A' is 
a finite automaton, it follows that C{ g is regular as it is the intersection of two 
regular sets. As C' gfl / is finite, also C' q\ / is regular. As a consequence we can 
change the finite automaton A' to accept the terms in C'g \ I (and otherwise 
to behave as before). This adaptation preserves equivalence, and we now have 
that C-j is attracted to I for j = 0,1, since, after the adaptation, C- qCiI = C' g 
and C}i n / is infinite. We repeat the procedure until C{ j is attracted to I for 
every f G {0, 1, ..., n — 1} and j G {0, 1} . □ 

The following lemma. Lemma 6, is a key contribution of our paper. It states 
that every injection / : A —>■ T* (with A some countably infinite set) can be 
transformed into a bijection g : A ^ T* such that, for all EGA, g[L] is a 
regular language whenever f[L\ is relatively regular in the image f[A]. Before 
proving the lemma, we sketch the construction. We construct g as the limit of 
a sequence of adaptations of /. Roughly speaking, we make / ‘more bijective’ 
in every step. We let 

- uo, ui, U 2 ,... be an enumeration of A, and 

- wo,wi,W 2 ,... be an enumeration of T*. 

Figure 2 sketches an injective function /; an arrow from Vi to Wj indicates that 

f(Vi) = Wj. 


^0 , 

Vi 

V2 , 


Vi ■■■ 

/ 

Wo 

Wi 

W2 

‘ W3 

W4 

WC* W7 Ws ■ ■ ■ 


Figure 2: Starting situation: the function /. 

The idea is that we change the target of arrows such that all words Wi for * G N 
become part of the image. For every natural number n = 0,1,2,... we will pick 
(while avoiding repetitions) a word (for kn G N) from the input domain and 
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then adapt / by stipulating Vk^ Wn- Then, in the limit, every word w G T* 
will be in the image. (In order for the limit of the adaptation to be a function, 
we also need to guarantee that every Vk„ will be picked precisely once.) 

The crucial point of the construction is the following: when changing arrows, 
we need to ensure that 

(*) the limit of the process preserves relative regularity. 

Note that for bijective functions 5 : A —>■ T* we have that g\L] is relatively 
regular in the image g[A\ = T* if and only if g[L\ is regular. Thus if we can 
ensure (*), the resulting bijective function g will have the desired property. 

How to pick the for ensuring (*)? Let Hq, Ai, A 2 ,... be an enumeration 
of all finite automata over the alphabet T. We write u v if for every i < n, 
the automaton Ai accepts the word u if and only if it accepts the word v. We 
then pick for every natural number n S N, a word Vk^ such that f{vk^) Wn 
(and the word Vk^ has not been picked before). In other words, we pick Vk^ 
such that the first n automata Aq, Ai,..., An-i cannot distinguish f{vk^) from 
the image Wn after the adaptation 1 —>■ Wn- This choice guarantees that 
every automaton Ai {i € N) is only affected by a finite number of adaptations, 
namely the first i transformation steps. For every further adaptation (j > i), the 
behavior of the automaton Ai is taken into account for the choice of Vk ^, and as 
a consequence the modification Vk^ 1 —t Wj preserves the acceptance behavior of 
Ai. Then for the limit g of the adaptation process we have for almost all n G N 
that Ai accepts /(u„) if and only if Ai accepts g{vn). In order to guarantee that 
every Vi will be picked eventually and that the adaptation preserves injectivity, 
we pick among the suitable candidates for Vk„ the one which appears first in 
the enumeration wo,wi,.... 

Remark 5. There is a caveat here that we will ignore in this sketch of the 
construction. We actually need to make sure that a word Vk„ with these prop¬ 
erties exist. To ensure this, the equivalence classes with respect to rnust be 
attracted to the image of /. This is in general not the case, but we can employ 
Lemma 4 to adapt the automata outside of the image of /. We refer to the 
proof of Lemma 6 for the details. 

We explain this process at the example of the function / given in Figure 2. 
For the first step n = 0, we want to adapt / such that wq becomes part of the 
image of /. Note that the relation ^0 relates all words of F*. As a consequence 
we can pick any word Vkg. We choose Vk^ = vi since the image f{vi) appears 
first in the sequence wq,wi, ..and we adapt the function / by ui 1 —>• rco- 


Vo V^ V9. Vo Va 



wq wi W2 Wo wa Wo rce w^ wg 
0 110 10 0 10 


Figure 3: Result of the first adaptation of the function /. 
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The result of this first adaptation is shown in Figure 3. For the second step 
(n = 1) we want that wi becomes part of the image. Now relates words 
that have equal behavior with respect to acceptance by the automaton Aq. The 
numbers 0 and 1 below the words Wi in Figure 3 indicate whether Aq accepts the 
word Wi (1) or not (0). The word wi is accepted by Aq and likewise are /(W 2 ) 
and /(W 4 ). Among these candidates, we choose Vk^ = Vi since /(vi) appears 
first in the sequence wi,W 2 , ■ ■ ■■ We modify the function / by setting V 4 wi. 


^0 , 


'^2 _ 

V 3 

JJ 4 


/ 





Wo 

Wi 

W2 

W3 

W4 

Wo 

01 

10 

10 

01 

11 

00 


10 



Figure 4: Result of the second adaptation of the function /. 

The result of the second adaptation is shown in Figure 4. Now n = 2 and 
^2 relates words that have equal acceptance behavior with respect to automata 
Aq and Ai. To this end, we now write Ao(iCi) Ai(wi) below each word Wi 
in Figure 4. The word W 2 is accepted by Aq but rejected by Ai. The only 
(displayed) candidate for Vk 2 exhibiting the same behavior is f{v 2 )- The result 
of this third adaptation is shown in Figure 5. 



Wq Wi W2 Wq W4 Wq Wq Wy Ws 


Figure 5: Result of the third adaptation of the function /. 

This process continues for every n S N, and the limit of this process is 
a bijective function g with the desired properties. The construction is made 
precise in the proof of Lemma 6 . 

Lemma 6 . Let A be a countably infinite set. For every injection f : A ^ T* 
there exists a bijection g : A ^ T* such that for all L C A, if f[L] is relatively 
regular in the image f[A], then g[L] is a regular language. 

Proof. Let / : A —>■ F* be an injective function. Let 

- ui, V 2 ,... be an enumeration of the set A, and let 

- Wq, wi, W 2 j ■ • ■ be an enumeration of the set F*, and let 

- Aq, Ai, A2,... be an enumeration of all finite-state automata over F. 

We abstract away from the domain by defining c : N —>■ F* by 

c(z) = f{vi) for all i € N. 
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In this proof, will construct a bijective function c? : N —>■ F* such that for all 
S' C N, if c[S] is relatively regular in the image c[N], then d[S] is a regular 
language. 

Then we can define the function (/ : A —>■ F* by 

g{wi) = d{i) for all t S N. 

The bijectivity of g follows immediately from bijectivity of d, and for every 
L C A with f[L] is relatively regular in f[A], we have: let S C N such that 
L = {vi \ i £ S}, then c[S] = f{L) is relatively regular in c[N] = f[A], and 
hence d[S] = g[L] is regular. As a consequence, it suffices to construct a function 
d with the properties above. 

For every n G N we will define a finite-state automaton A'^ with properties 
described below. For each n G N, we define the equivalence relation C F* xF* 

by 

u v! Vt G {0,1,..., n — 1}. A'('u) = A[{u') , 

for all u, u' G F*. So relates words that are not distinguished by the automata 
Ag, A'l,..., A'j^_i. The relation gives rise to a partition 

En = { N^„ I u G F* } 

where = {u' G F* | u u'} is the equivalence class of u with respect to 

We are ready to carry out the central construction. We first describe what 
objects we will define and their main properties. Then we give specifications 
of these objects, and afterwards we show that the objects are well-defined and 
that the properties hold. 

We are going to define, by induction on n G N, 

(i) a natural number G Kn , where 

Aji — M \ Kn Kn — {^0? 5 • ■ • 5 ^n—l} ? 

such that 

c(fc„) '^n Wn ■ (1) 

(ii) a bijective map (i„ : N ^ , where 

In — c[A^] U Wn ffn — {'^0 , 5 ■ • ■ 5 '^n—1 } ? 

(iii) an automaton A'n , such that 

(a) Mu £ In- An{u) = A'n{u ), and 

(b) EnJri is attracted to In+i. 
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We will now give specifications of (i)-(iii), and thereafter show that they are 
well-defined, and satisfy the above mentioned properties. 

(i) We define G as follows: 

kn = c“^(min([w„]...„ n c[7t:„])), 

where min denotes the minimum with respect to the order < on F* given 
by the enumeration wq < wi < W 2 < ■ ■ ■ ■ 

(ii) The map (i„ : N —>■ is defined for all z G N by 


dn{i) 


Wj ii i = kj for some j G {0,1 ,..., n — 1} , 
c{i) Hi G Kn- 


(iii) For we choose the automaton A' guaranteed to exist by Lemma 4 
where the lemma is invoked with A = An, I = In+i, and E = En- 

We note that ~o = T* x F*, and, by definition of it follows that 

En is finite and every C G En is regular language. (2) 

We prove that (i)-(iii) are well-defined, and that the following properties 

hold: 

(a) c[Kn\ nWn = 0 , 

(b) En is attracted to 

(c) dn is a bijection, and 

(d) kn is well-defined. 

We first show that, for every n G N, items (c) and (d) follow from (a) and (b). 

(c) We show that : N is bijective. Surjectivity of dn is immediate 

by definition of To show injectivity, assume there exist zi,Z 2 G N such 
that dn{ii) = dnii 2 )- From (a) and the definition of dn, it follows that 
either ii,i 2 G Kn or ii,i 2 G Kn- If Zi,i 2 G Kn, then there are ji,j 2 G 
{0,1,...,n — 1} such that ii = kj^ and 12 = kj^. Then it follows that 
Wj-^ = dn{ii) = dn(* 2 ) = Wj 2 , which can only be in case if ji = j 2 - Hence 
ii = kj^ = kj^ = 12 . If zi,Z 2 G Kn, then c(zi) = dn{ii) = d„(z 2 ) = c(z 2 )- By 
injectivity of c we have ii = Z 2 . 

(d) To see that kn is well-defined it suffices to show that fl c[iF„] is 

non-empty. By (b) we know that either n/„| = 00 , or C 

- For |[wn]~„ n 7„| = 00 , it follows that fl c[iF„]| = 00 . 

- For [wn]r..„ Q In, we get Wn G ([?^]...„ n /„). Since = c[iF„] U Wn, 

and Wn ^ Wn, (a) entails Wn G c[Kn], and hence H c\Kn\ yf 0. 
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We note that, indeed, (1) follows by the choice of kn- 

We now prove (a) and (b) by induction on n G N. For the base case we have: 

(a) c[Kq\ n Wo = 0 since Wq = 0, and 

(b) Eo = {F*} is attracted to Iq = c[N]. 

For the induction step, let n G N be arbitrary. We assume c[Kn\ H Wn = 
0 and that En is attracted to /„ (induction hypothesis). We first prove the 
implication 



( 3 ) 


If Wn G c[Kn], also Wn G fl c[iG„]). Since c[Kn] n Wn = 0 by induc¬ 

tion hypothesis, Wn is the smallest element in [wn]r^^ H c[Kn]. Hence = 
min([w„],..„ n c[Kn]) and c(fc„) = 

(a) By the induction hypothesis we have c[Kn] H Wn = 0, and so we have 
c[Kn+i] n Wn = 0. To see that c[Kn+i] n Wn+i = 0, it suffices to show 
Wn ^ c[iF„+i]. Assume, to derive a contradiction, that Wn G c[Ar„+i]. 
Then also Wn G c[Kn]. Hence, by (3), we have Wn = c{kn). This contradicts 
Wn G c[Ar„+i], since, by injectivity of c, we have c[Arri+i] = c[Kn]\{c{kn)} = 


c[Kn] \ {Wn}. 


(b) We have to prove that En+i is attracted to In+i- We first show that 


( 4 ) 


En is attracted to In+i- 


By injectivity of c we have In+i = c[Kn] \ {c(fc„)} U Wn U {w„}. Moreover, 
since by induction hypothesis c[Ar„]nW„ = 0, we have c[Ar„]\{c(fc„)}UlF„U 
{w„} = {c[Kn] u Wn) \ {c(fc„)} U {w„}. Hence /„+i = (/„ \ {c(fc„)}) U {w„}. 
Let C G En be arbitrary. By induction hypothesis we know that C is 
attracted to we distinguish the following two cases: 

- If IC n In\ = cx), then \C fl In+i \ = oo. Hence C is attracted to In+i- 

- Assume C C If c(fc„) ^ C, then clearly C C In+i- If c(fc„) G C, 

then C = [wn]~„ since c{kn) ~n Wn due to (1). From Wn € C C 

we obtain Wn G c[Ari]- Hence, by (3) we have Wn = c(fc„) and so 

— ^n+1 ■ 

This concludes the proof of (4). 

Recall that A'^ is the automaton A' obtained by invoking Lemma 4 with 
A = An, I = In+i, and E = En- Both requirements of the lemma are 
established above, in (2) and (4), so the automaton A'^ is well-defined. Let, 
moreover, E' be the resulting partition obtained in the lemma. Lemma 4 
guarantees that E' is attracted to In+i- Moreover, by definition of En+i, 
we have E' = En+i- Hence En+i is attracted to In+i- 
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We now establish that every natural number k will be picked as a kn even¬ 
tually. That is, for every fc G N there exists n G N such that kn = k. Let /c G N. 
For every n G N, the set c[Kn] is the image under c of all natural numbers that 
have not yet been picked as a ki for 0 < * < n — 1. Since wo,wi,... is an 
enumeration of F*, there exists n G N such that c{k) G Wn- Hence k ^ Kn since 
c[Kn] n Wn = 0 by property (a). Thus k will be picked eventually. 

We dehne the encoding d : N —>• F* by 

d(i) = lim dn{i) {i G N) 

n—^oo 

These limits are well-dehned since, for every i G N: there exists n G N with 
i = kn, and we have dm(i) = c{i) for all m < n and dm{i) = Wn for all m > n. 
From the above, it follows that the function d has the following property: 

d{kn) = Wn for all u G N . (5) 

We now show that d is indeed a bijection. Since by construction, every wo,wi,W 2 , 
occurs precisely once in the image of d, d is injective. Moreover, we have seen 
above, that the set {kg, k, k 2 , ■ ■.}, the domain of d, contains all natural numbers. 
Hence d is a bijection. 

Finally, we show that for all L C N, if c[L] is relatively regular in the image 
c[N], then d[L] is a regular language. Let L C N be such that c[L\ is relatively 
regular in c[N]. Then there exists a regular language R such that c[L\ = RC\ c[N]. 
Let TO G N be such that R = Lang(Hm). Then for all u G N, 

n G L -4=^ c(n) G Lang(Hm). (6) 

By the above construction we have for all w G Im 

w G Lang(H^) w G Lang(H„). 

By definition, coincides with c[N] for all but finitely many words. Hence for 
almost all n G N 


c(n) G Lang(H„) c(n) G Lang(H^). (7) 

By the definition of d, and property (1), we find: 

d(fc„) = Wn ^n c{kn) for all 71 G N. 

Due to ^n C for every n > m, we obtain that 

d{kn) c{kn) for all n > TO, 

and hence, since N={A:„:uGN} holds, that 

d{i) ~m c{i) for almost all i G N. 
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Hence for almost all n we have 


c(n) € Lang(H;,) 
c{n) € Lang(Hm) by (7) 

n € L by (6). 

As d is bijective, we obtain 

w e Lang(A^) <;=^ w G d{L) 

for almost all w GT* . Hence d{L) differs only by finitely many elements from a 
regular language and is consequently itself regular. □ 

The following proposition states that, under certain conditions, the bijective 
function g constructed in Lemma 6 is computable. Obviously, the injective 
function / that is lifted to g must be computable to start with. Moreover, 
we need to be able to decide for regular languages whether their intersection 
with the image of / is empty, finite or infinite. This enables us, in case of 
a finite intersection, to compute this intersection, and to decide whether the 
equivalence classes En are attracted to the image of / (and /„). This suffices to 
ensure computability of g constructed in the proof of Lemma 6. 

Proposition 7. Let A he a computable, countably infinite set and let f : A ^ T* 
be a computable injection such that for every regular language R (given by an 
automaton), emptiness and finiteness of the set i? n/[L*] is decidable. There 
exists a computable hijection g : A ^ V* such that for all L Q A, if f[L\ is 
relatively regular in the image f[A], then g[L] is a regular language. 

Proof. By close inspection of the proofs of Lemmas 4 and 6 it can be estab¬ 
lished that the construction of the encoding g from the encoding / preserves 
computability under the stated conditions and by choosing computable enumer¬ 
ations vo,vi,... and wo,wi,... oi A and L*, respectively. □ 

We are ready to state our main result. 

Theorem 8. Let S and L be finite alphabets, with |r| > 2. Let T C p(S*) be a 
countable set of formal languages. There exists a bijective function 5 : S* ^ T* 
such that for every L G C, the image g[L] is a regular language. 

Proof. By Lemmas 2 and 6. □ 

The following corollary justifies the title of this paper. 

Corollary 9. There exists a computable bijective function f : T,* ^ T* such 
that the image function /[_] is regularity preserving, but the preimage function 
f~^[-] is not. 


d(n) G Lang(A(„ 
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Without the requirement on the function / to be computable, we could 
prove the statement as follows: Let C C p(S*) be the (countable) set of all 
recursive languages over E. By Theorem 8 there exists a bijective mapping 
/ : E* —>• r* such that f[L\ is regular for all L G C. Then, clearly, /[_] is 
regularity preserving while /~^[-] is not. However, the function / obtained in 
this way is not computable. To obtain a computable function, we argue as 
follows. 

Proof of Corollary 9. Let E = {0,1}. We define a ‘balancedness’ function b : 
E* ^ E for all w S E* by b{w) = 1 if the word w contains an equal number of 
zeros and ones, and b{w) = 0 otherwise. Then we define / : E* —>■ E* by 

f{w) = b{w)w 

for every w G E*. Then we have: 

(i) For every regular language L C E*, we have that f[L\ is relatively regular 
in /[E*] (an automaton can simply ignore the first letter). 

(ii) However, the function /~^[-] is not preserving (relative) regularity. To see 
this, let X = {u; I b{w) = 1}. Clearly /“^[/[-E]] = X is not regular. But 
f[X] is relatively regular in /[E*], since f[X] consists precisely of those 
words in /[E*] that start with letter 1. 

We now invoke Proposition 7 for lifting / to a computable, bijective g. The 
conditions of the proposition are satisfied since / is computable, and the image 
/[E*] of / is a context-free language. The intersection of context-free languages 
with regular languages is context-free, and finiteness and emptiness are de¬ 
cidable. The proposition guarantees the existence of a computable, bijective 
function g : E* —>• E* such that for every L C E* that is relatively regular 
in /[E*] we have that g[L\ is regular. Then by (i) we have that g[J\ preserves 
regularity. From (ii) it follows that g[X] is regular while g~^[g{X)\ = X is not. 
Hence does not preserve regularity. □ 

We strengthen the statement of Corollary 9 by extending it to the preser¬ 
vation of membership in countable classes of languages that include the regular 
languages. For this, we use the following stipulation. For an arbitrary set S 
of languages over A, we say that a function F : p{A*) —>■ p{B*) preserves 
membership in S if, for all languages over A, L G S implies F{L) G S. 

Corollary 10. Let S C p(E*) be a countable set of languages that includes the 
regular languages. Then there exists a bijective function f : {0,1}* —>■ {0,1}* 
such that /[_] preserves membership in S, but f~^[-] does not. 

Proof. Let <S' = 5 U {X} where X is a language not in S. By Theorem 8 there 
exists a bijective function / : E* ^ F* such that f[L] is regular for all L G S'. 
Then, /[_] preserves membership in S while f~^[-]; note that f{X) is regular 
and hence f{X) G S, but f~^[f{X)] = X ^ S. □ 
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4 Consequences for Comparing 
Models of Computation 

It turns out that our main results have some remarkable consequences in the 
context of comparing computational models using concepts proposed by Boker 
and Dershowitz in a series of publications [3, 5, 4]. The authors summarize their 
goal as follows: 

“We seek a robust definition of relative power that does not it¬ 
self depend on the notion of computability. It should allow one 
to compare arbitrary models over arbitrary domains via a quasi¬ 
ordering that successfully captures the intuitive concept of compu¬ 
tational strength. [. ..]^” [5] 

This motivation leads them to specific choices of simple and liberal conditions 
on encodings. Encodings are typically used to translate between models of 
computation that act on different domains. So an encoding p : Di —^ D 2 
can facilitate the simulation of the input-output behavior of a machine Mi 
belonging to a model A4i with domain Di by a machine M 2 from a model M .2 
with domain D 2 : 


Di 


Ml 


Di 


D2 


M2 


-2 D2 


To prevent codings from participating too strongly in the simulation of a 
computation on a machine Mi through a computation on a machine M 2 (and 
thereby from substantially alleviating, for the simulating machine M 2 , the task 
that is solved by the simulated machine Mi), codings are usually required to be 
computable in some sense. Frequently, one of the following two restrictions are 
stipulated (see for example Rogers’ classic book [20, p.27,28]): 

(1) Codings must be ‘informally algorithmic’, ‘informally computable’, or ‘ef¬ 
fective’ in the sense that they can be carried out by an in principle mech¬ 
anizable procedure. 

(2) Codings are required to be computable with respect to a specific model, 
for example by a Turing machine. 

Boker and Dershowitz reject such prevalent stipulations: 

^This passage continues: “Eventually, we want to be able to prove statements like ‘analogue 
machines are strictly more powerful than digital devices’, even though the two models operate 
over domains of different cardinalities.” 
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“EfTectivity is a useful notion; however, it is unsuitable as a gen¬ 
eral power comparison notion. The first, informal approach is too 
vague, while the second can add computational power when dealing 
with subrecursive models and is inappropriate when dealing with 
non-recursive models.” [4] 

As a consequence, they go on to use classes of encodings that do not constrain 
(at least not explicitly) the cost that is necessary to compute an encoding. In 
particular, they define three concepts of comparison (see Definition 17 below) 
that are, respectively, based on:^ 

(i) encodings (injective functions) without any additional requirement; 

(ii) encodings that are ‘decent’ with respect to the simulating model AI 2 that 
shoulders the simulation, in the sense that A42 is able to recognize the 
image of the coding; 

(hi) bijective encodings. 

We will show that each of these concepts admits some quite counterintuitive 
consequences. At first these anomalies pertain only to decision models, the 
subclass of all models that only obtain ‘yes’/‘no’ as computation result. But it 
turns out these phenomena apply also to more broad classes of models. 

In order to formally state our results, we repeat here the basic definitions in 
[5, 4], and extend them by straightforward adaptations for decision models. 

By abstracting away from all intensional aspects of models of computation 
that concern mechanistic aspects of stepwise computation processes, Boker and 
Dershowitz define a model extensionally as an arbitrary set of (extensionally 
represented) partial functions over some domain. 

Definition 11 ([5, Def. 2.1]). A model of computation is a pair M = {D,T), 
where D is a set of elements, the domain of M, and if = {f \ f : D ^ DU{_L}} 
is a set of functions with _L ^ We write dom^i for the domain of A4. (We 
assume that _L is a fixed element not contained in the domain of any model.) 

We define decision models as models of computation consisting of total func¬ 
tions that yield a definite ‘yes’/‘no’ answer. 

Definition 12. A decision model (model of computation for decision models) is 
a model of computation M. — {D,tF), such that {0,1} C D, and f[D] C {0,1}, 
for all f € T. 

Now codings between domains of models are defined. 

Definition 13 ([5, Def. 2.2]). Let Di and D 2 be domains of models of computa¬ 
tion. A coding (from Di to D 2 ) is an injective function p : I?iU{_L} —^ D 2 U{_L} 
such that p{x) = _L if and only if x = _L, for all x € Di. 

■^Note that encodings of the notions (i) and (ii) here are more liberal than those in (1) and 
(2) above, and that therefore the use of such encodings does not address the concern raised 
in the preceding quotation regarding the addition of computational power when dealing with 
subrecursive models. 
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For codings p between decision models it could be desirable to demand that 
p(0) = 0 and p{l) = 1. We do not to take up this restriction, for a pragmatic 
reason connected to the definition of ‘simulation’ immediately below. If namely 
a non-constant function / in a decision model Mi is simulated via p by a 
function p in a decision model M 2 , then it follows that either p(0) = 0 and 
p(l) = 1, or p(0) = 1 and p(l) = 0. In both cases it can be said that decisions 
taken by / are faithfully modelled by corresponding decisions taken by g. 

Definition 14 ([5, Def. 2 . 3 ]). Let Mi = {Di,Fi) and M2 = {02,^2) be 
models of computation. Let p be a coding from Di to II 2 . We define: 

(i) For g G T 2 and / € Ju we say that g simulates f via piigop = pof 
holds, as in the following diagram: 


Di 


P 


D2 


f 


9 


Di 


P 


D2 


(ii) M2 simulates Mi via p, denoted by Mi M2, if for every f G Ti there 
is a p S such that g simulates / via p. 

The ‘decency’ requirement for codings mentioned before is defined as follows 
in [4]. There, Boker stresses that this requirement follows classic definitions of 
computable groups by Rice [18, p.298] and Rabin [17, p.343]. 

Definition 15 ([4, Def. 52]). Let Mi = (Di, J^i) and M2 = {D2, J-2) be models 
of computation. A coding p from Di to D2 is called decent with respect to M2 
if the image p\Di\ can be recognized by M2, in the sense that there is a total 
function g G T2 such that p{Di] = p[I? 2 ] and for all y G D2, we have g{y) = y 
if and only if p € p[Di]. 

We note that according to Definition 15 a coding p can be decent with respect 
to a decision model M2 only if p[Di] C {0,1}. Since p is injective, [Dij < 2 
follows, and so A4i can only be a rather trivial model. Therefore we adapt the 
notion of decency in an obvious way to accommodate decision models. 

Definition 16 . Let Mi = (Di,iFi} and M2 = {02,^2) be models of compu¬ 
tation. A coding p from Di to D2 is called decent* with respect to A42 if the 
image p[Di] can be recognized by M2, in the sense that there is a total func¬ 
tion g G T2 and an element d G D2 such that for all y G D2, we have g{y) = d 
if and only if p € p[Di]. 

With the concepts ‘model of computation’ and ‘simulation’ defined, Boker 
and Dershowitz introduce three comparison preorders for models, which are 
based on three classes of codings as mentioned above. In addition to the preorder 
induced by decent codings, we also define a variant preorder induced by decent* 
codings. 
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Definition 17 ([4, Def. 52]). Let A4i = and A42 = {D 2 , ^ 2 ) be models 

of computation. We define: 

(i) At 2 is at least as powerful as A4i, denoted by 


if -^1 -^2 for some p. 

(ii) M 2 is at least as powerful as Mi via a decent coding, which we denote by 

M 1 ^decent M 2 , 

if -^1 -^2 for some decent coding p with respect to M 2 - M 2 is at 

least as powerful as Mi via a decent* coding, which we denote by 

Mil ^decent* At 2 ; 

if Mil <p M 2 for some decent* coding p with respect to Al 2 . 

(hi) A 42 is at least as powerful as Mi via a bijective coding, which we denote 
by 


Mil ^bijective At 2 , 


if Mil At 2 for some bijective coding p. 


With these definitions in place, we are now able to state, and prove, our 
results concerning the comparison of decision models. For this we denote, for 
an alphabet F with {0, 1} C F, by DFA(r) = (F*,!?) with 


l?={/:r*^{ 0 ,l} 


3M DFA Vw e F* 

( f{w) = 1 M accepts w) 


the decision model consisting of all functions / : F* —>■ { 0 , 1 } that describe 
the acceptance/non-acceptance behavior of a DFA with input alphabet F. The 
model TMD(r) over input alphabet F of Turing-machine deciders is defined 
analogously. 

The proposition below is an easy consequence of Lemma 2. 


Proposition 18 . Let E, F he alphabets, where {0,1} C F. Then for every 
countable decision model M with domain E* 


At < DFA(r), (8) 

holds, that is, deterministic finite state automata with input alphabet F are at 
least as powerful as M. 


20 



Proof. Every decision model A4 = with domain E* and with count¬ 

able set P' of computed functions corresponds to the countable set £m = 
{Lf I / e P} of languages that are defined, for / e as L/ = {lu G E* | /(lu) = 1}. 

By Lemma 2 there exists an injective function p : E* —>■ L* such that p[Lf\ is 
relatively regular in p[E*], for all f G P. Now it is straightforward to verify that 
p is a coding between the domains of the models A4 and DFA(r) that facilitates 
the simulation of every / G by a function g : L* —>• {0,1} that denotes the 
acceptance/non-acceptance behavior of a deterministic finite-state automaton 
with input alphabet L. This shows ( 8 ). □ 

Proposition 19 . Let T be an alphabet with {0,1} C T. Then there is a coding 
p : r* —r* such that TMD(r) <p DFA(r) holds. But any such a coding p 
cannot be computable. 

Proof. The main statement follows from Proposition 18. That p cannot be com¬ 
putable can be seen as follows. Suppose that p is computable. Let Aq,Ai,... and 
wo,wi,... be recursive enumerations of all finite automata and words over E. 
Then the language L = {wn | n G N, p{wn) 4- Lang(A„)} is Turing computable, 
but p[L] is not regular. □ 

This statement can be strengthened to a bijective, and therefore (see the 
proof) also decent*, simulation with finite-state automata by using Lemma 6 
and Theorem 8 . 

Corollary 20 . Let E and T be alphabets, where {0,1} C T. Then for every 
countable decision model A4 with domain E* the following two statements hold: 

(i) M <bijective DFA(r), 

(a) M <decenf DFA(r). 

That is, deterministic finite-state automata with input alphabet T are at least as 
powerful as the model A4, both via bijective and via decent* codings. 

Proof. Statement (i) follows by an argumentation analogous to the proof of 
Proposition 18 in which the use of Lemma 2 is replaced by an appeal to our 
main theorem, Theorem 8 . 

Statement (ii) follows directly from statement (i): For bijective codings p : 

E* —>■ P*, the image of p coincides with P*, which is trivially recognizable by a 
finite-state automaton. □ 

Remark 21 . As mentioned above (just before Definition 16), decent codings 
in the sense of [4] do not form a sensible notion for decision models. However, 
for every countable decision model A4 and every alphabet P with { 0 , 1 } C P it 
holds that 


M ^decent DFAid(r) 
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where DFAid(r) is the extension of DFAid(r) by adding the identity function 

id : r* ^ r*. 

Sequential finite-state transducers FST(r) (see e.g. [21]) over alphabet F 
form a natural computational model that extends DFAid(r). Thus for an alpha¬ 
bet r with {0,1} C r we also obtain the very counterintuitive result: 

M ^decent FST(r), 

that is, sequential finite-state transducers are at least as strong via a decent 
coding as every countable decision model. (Here we consider finite-state trans¬ 
ducers that are able to recognize the end of a word.) In particular, every Turing- 
machine decider can be simulated by a sequential finite state transducer via a 
decent coding. 

Remark 22 . These results raise the question, whether these anomalies only 
concern decision models. In particular, one may wonder whether the comparison 
of computational models avoids counterintuitive results when additional require¬ 
ments are imposed on the models that are compared. A candidate requirement 
would be to enforce that the output of the models must have an infinite range. 
Or, even stronger, we could require the following property: A class of models 
A4 = {D, iF) is image-complete if for every non-empty computable set I ^ D 
there exists f & F such that f[D] = I. 

Let T = (S*, consist of all Turing machines such that for every f G F 
we have there exists a finite set Ly C E* and for all x G E* we have f{x) € 
{x}UL/U{_L}. Thus functions f gT map words either to themselves or into a 
finite set that may depend on /. The class 7” is a natural model because it can 
be implemented by a recursively enumerable set of Turing machines^. Note that 
7~ is a strict extension of Turing-machine deciders and it is an image-complete 
model. This can be seen as follows: Let / C E* be any non-empty computable 
set. Let i G I and define the function / : E* —)> E* for all x G E* by: /(x) = x 
if X G / and /(x) = i, otherwise. Then /[E*] = I and / G T. 

Then the model T can be simulated by two-way sequential, finite-state trans¬ 
ducers [7] via decent codings: 

^decent 2- FST(r). 

We have already argued that the Turing-machine deciders can be simulated by 
FST(r). This can easily be generalized to a finite number of output words L 
(finite-state transducers can output words instead of only symbols). Now we 
assume that one symbol w G L symbolizes the identity output; then the two-way 
finite-state transducer, instead of producing this output word, can walk back to 
the beginning of the input and reproduce the input word as output word. 

Our results suggest that there are definite limitations to the concepts of 
power comparison for models of computation by Boker and Dershowitz. These 

®The idea is to enumerate all Turing machines and finite sets Lf, and adapt the machines 
to check on termination whether the output is in Land otherwise make sure that the output 
is equal to the input. In this way, we obtain all machines that are necessary to implement F. 
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concepts have an ‘absolute’ flavor insofar as they do not formulate any explicit 
constraints on the computability of encodings used for simulations. The coun¬ 
terintuitive consequences pertain primarily to decision models (yet this is a 
blurry concept, see Remark 22), and do not extend to models that include all 
partial-recursive functions (see Corollary 24 below). Yet they demonstrate that 
these comparison concepts lack the desired robustness. 

We note that our results are not the first indications of anomalies. In [5, 
Example 5.1] Boker and Dershowitz show that Turing-machine deciders are 
not a complete model of computation, in the sense that this model can be 
strictly extended to incorporate a non-recursive set. Our results strengthen 
this example naturally in the following three ways: (i) to bijective, and decent 
encodings, (ii) to use finite automata instead of Turing-machine deciders, and 
(iii) to arbitrary countable models as extensions. This is because, as we have 
shown, finite automata can be extended, via decent codings, to any countable 
decision model, and consequently the same holds for Turing-machine deciders. 
Hence, even decent encodings facilitate the simulation of all Turing-machine 
deciders by finite automata, more precisely, by finite-state transducers. 

The following theorem is an easy consequence of the concepts developed by 
Shapiro in [25]. He calls a number representation r : N —>■ E* acceptable if it 
is bijective, and if the successor function succ : N ^ N can be simulated by a 
Turing machine on the representations, that is, if the lifting succ’’ : E* ^ E* of 
succ with the property: 

succ’’(r(n)) = r(succ(n)), for all n S N, 

is Turing computable. 

Theorem 23. A bijective encoding / : E* ^ T* zs computable if and only if 
there is an acceptable number representation r such that f o r is acceptable as 
well. 

This theorem implies that Corollary 20 (i) does not generalize to models 
that compute the partial-recursive functions. 

Corollary 24. There is no bijective encoding p : E* ^ E* such that TM(E) 
^-FST(E) holds, where TM(E) is the model of Turing machines over alphabet 

E. 


While certainly more investigation is needed, we also interpret our results as 
follows. For comparing the computational power of models of computation over 
different domains, it is crucial to make clear how computational power should 
be measured for the purpose at hand. After having settled on a reasonable 
measure, this measure can then be used to constrain the computational power 
of admissible codings that may act as a trustworthy intermediary between the 
models. 
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5 Consequences for Generalized Automaticity 

Finite-state automata can be used to generate infinite sequences, see [1]. This is 
usually done using the standard base-fc representation {n)k € {0,1 ,..., fc — 1}* 
of the natural numbers n S N. A sequence a G is called k-automatic if, for 
some deterministic finite-state automaton A with output (DFAO, see Section 2) 
we have (j{n) = A{{n)k) for all n G N. 

This concept has been generalized in several ways, where different number 
representations are fed to the automaton, see, e.g., [24, 19, 6]. This motivates 
the study of automaticity with respect to arbitrary number representations, 
which is part of work in progress of the present authors with Clemens Kupke, 
Larry Moss, and Jan Rutten. 

Definition 25. Let c : N F* be a number representation. A sequence cr G 
over an alphabet A is c-automatic if there exists a DFAO A such that a{n) = 
A{c{n)) for all n G N. 

Lemma 26. Let c : N —>■ F* be a number representation. A sequence a G A^ 
is c-automatie if and only if for every a G A the ‘fiber’ {c{n) \ <j(n) = a} is 
relatively regular in c[N]. 

Proof. Along the lines of Lemma 5.2.6 in [1]. □ 

Corollary 27. For every injective function c : N —>■ F* there is a bijective 
function d : N —F* such that for every a G A^ we have: if a is c-automatic, 
then a is also d-automatic. 

Proof. Let c : N —)• F* be an injection. Let d : N —)• F* be the bijective function 
obtained from c by Lemma 6. Let a G A^ be c-automatic. We show that a 
is d-automatic by an application of Lemma 26. Let o G A. Then the fiber 
{c(n) I cr{n) = a} is relatively regular in c[N] by Lemma 26. By Lemma 6 we 
obtain that the fiber {d{n) \ <j(n) = a} is regular (and consequently relatively 
regular in d[N] = F*). Hence, by Lemma 26, a is d-automatic. □ 

The following proposition shows that the implication in Corollary 27 cannot 
be strengthened to equivalence of c- and d-automaticity: not for all injective c 
there is a bijective d so that c-automaticity and d-automaticity coincide. 

Proposition 28. Define the representation c : N {0,1}* by c(n) = 0"^ Then 
we have: 

(i) For every sequence a G A^, a is c-automatic if and only if a is ultimately 
constant. 

(ii) For every bijection d : N —>■ {0,1}*, there is a d-automatic sequence that 
is not ultimately constant. 

Proof. We first prove the two implications of (i). 
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(=>) Let a € be c-automatic. That is, for some automaton A, a(n) = 
A(c(n)) for all n G N. As there are finitely many states in A, there exists 
no,i G N such that i > 0 and d{qo,0'^) = for all n > no, 

where 6 is the transition function of A and qg its starting state. Let 
Too G N be the smallest integer such that too! > ng + £. Then we have 
(5(go 7 O’"') = 5(go7O’"“0 for all to > mg. The reason is that too! > no and, 
for all TO > Too, to! is a multiple of I (so that to! = mg! + k£ for some 
k G N). 

(<^) Let (T G be ultimately constant, that is, there exists no G N such that 
cr(n) = a{ng) for all n > ng. Let to = no!. We define an automaton A with 
states qg,qi,... ,qm, and transition function S defined by S{qi, 0) = 
for all i G {0,1,..., TO — 1} and S{qm,0) = q-m- For the output of qt 
(0 < i < to) we take a{j) if f = j!. The output of the other states is 
irrelevant. Clearly we now have (j(n) = A{c{n)) for all n G N, and so a is 
c-automatic. 

For (ii), let d : N —> {0}* be a bijective encoding. Let A be an automaton 
such that A(0^") = 0 and A(0^"+^) = 1 for all n G N, and define a G {0, 
by cr(n) = A{d{n)) for n G N. Note that a is d-automatic by definition. As d 
is bijective, there are infinitely many to G N such that d(m) is of the form 0^", 
and there are infinitely many to G N such that d(m) is of the form 0^"+^. Hence 
cr is not ultimately constant. □ 

The following corollaries are reformulations of our main result. Theorem 8 , 
for recognizability and, more generally, automaticity, respectively. 

Corollary 29. For every countable class C of sets of natural numbers there is 
a bijective function r : N —^ S* such that every S G C is r-recognizable (i.e., 
there is a finite automaton deciding membership n G S on the input ofr{n)). 

Corollary 30. For every countable class S C A^ of infinite sequences over a 
finite alphabet A, there exists a bijective number representation d : N —>■ F* such 
that every a € S is d-automatic. 

6 Conclusion and Further Questions 

Our main result. Theorem 8 , states that for every countable class C C p(S*) 
of languages over a finite alphabet S, and for every alphabet F with more than 
two symbols, there exists a bijective encoding / : S* —>• T* such that for every 
language L G £ its image f[L] under / is regular. 

Furthermore we have shown that this result has a number of noteworthy con¬ 
sequences in language theory for regularity preserving functions, in computabil¬ 
ity theory for a concept for comparing the power of models of computation, and 
in the theory of automatic sequences for a generalization of this concept with 
respect to arbitrary number representations: 
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(A) There exists a computable bijective function / : E* ^ T* such that the 
image function /[_] of / is regularity preserving, but the preimage function 

is not (Corollary 9). 

(B) In the sense of [5], finite-state automata are as powerful as any countable 
decision model (e.g., Turing-machine deciders) (Proposition 18). This even 
holds for the strongest notion of comparison in [5], namely that with respect 
to bijective encodings (Corollary 20). Similar counterintuitive consequences 
also affect computational models beyond decision models. 

(C) For every countable class C of sets of natural numbers there is a bijective 
number representation r : N ^ E* such that every set S G C is r-recogniz- 
able (i.e., there is a finite automaton deciding membership n G S on the 
input of r(n)) (Corollary 29). 

More generally, for every countable class S C of infinite sequences over a 
finite alphabet A, there exists a bijective number representation d : N —)• T* 
such that every a G S is d-automatic (Corollary 30). 

These results also answer the questions in Section 1 concerning the hierar¬ 
chy of number representations. From (A) it follows that the hierarchy is proper: 
there are bijective representations ri and r 2 such that ri subsumes r 2 , but not 
vice versa. From (C) it follows that every countable class C C p(N) of languages 
is contained in the countable class of all r-recognizable languages, for some rep¬ 
resentation r. Moreover, (C) implies that every injective number representation 
is subsumed by a bijective number representation, and that no representation 
subsumes all others, since the class of r-recognizable sets of natural numbers is 
always countable. 

We conclude with two questions: 

- How far can computable bijective encodings / : E* ^ F* extend the 
class of recognizable languages, that is, what classes £/ = {L C E* | 
f[L] is a regular language} can we obtain for a computable bijective /? 
For example, is there a computable (bijective) encoding that makes pre¬ 
cisely all context-free languages recognizable? 

- Rigo [19] describes a class of number representations that characterizes 
the morphic sequences. Our results entail the existence of a bijective 
representation r : N —^ E* such that every morphic sequence is r-automa- 
tic. Is there a computable bijective representation r such that precisely 
the morphic sequences are r-automatic? 
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